Published online 28 December 2012 



Nucleic Acids Research, 2013, Vol. 41, No. 4 e57 

doi:10.1093/nar/gksl313 



Synthetic gene expression perturbation systems 
with rapid, tunable, single-gene specificity in yeast 

R. Scott Mclsaac 1 ' 2 '*, Benjamin L. Oakes 1 , Xin Wang 1 ' 3 , Krysta A. Dummit 1 , 
David Botstein 1 ' 3 and Marcus B. Noyes 1 '* 

1 The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA, 2 Graduate 
Program in Quantitative and Computational Biology, Princeton University, Princeton, NJ 08544, USA and 
department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA 

Received October 8, 2012; Revised November 14, 2012; Accepted November 15, 2012 



ABSTRACT 

A general method for the dynamic control of single 
gene expression in eukaryotes, with no off-target 
effects, is a long-sought tool for molecular and 
systems biologists. We engineered two artificial 
transcription factors (ATFs) that contain Cys 2 His 2 
zinc-finger DNA-binding domains of either the 
mouse transcription factor Zif268 (9 bp of specifi- 
city) or a rationally designed array of four zinc 
fingers (12 bp of specificity). These domains were 
expressed as fusions to the human estrogen 
receptor and VP16 activation domain. The ATFs 
can rapidly induce a single gene driven by a syn- 
thetic promoter in response to introduction of an 
otherwise inert hormone with no detectable 
off-target effects. In the absence of inducer, the 
synthetic promoter is inactive and the regulated 
gene product is not detected. Following addition of 
inducer, transcripts are induced >50-fold within 
15min. We present a quantitative characterization 
of these ATFs and provide constructs for making 
their implementation straightforward. These new 
tools allow for the elucidation of regulatory 
network elements dynamically, which we demon- 
strate with a major metabolic regulator, Gcn4p. 

INTRODUCTION 

A current goal of synthetic biology is the development of 
modular tools for programming genetic circuits. 
This includes the ability to control gene expression in a 
fast, quantitative fashion without otherwise affecting 
cellular physiology. Rapid perturbation of a protein's 
abundance followed by monitoring of the genome-wide 



transcriptional response has previously been used to 
dissect the architecture of complex regulatory networks 
in vivo (1,2). This kinetic information can directly inform 
quantitative models of gene regulation and elucidate novel 
design principles. 

In yeast, the most commonly used expression systems 
rely on nutritional perturbations such as the addition of 
copper, sugar or methionine to modulate levels of a target 
gene's expression (3,4). These nutritional perturbations 
can be quite severe. For example, a gene whose expression 
is driven by the MET25 promoter can be selectively 
repressed, but this requires the addition of 1 mM extracel- 
lular methionine (5). For overexpression of a target gene, 
the transcription factor (TF) Gal4p is commonly used. 
A target gene placed under the control of a promoter con- 
taining Gal4p recognition sequences (UAS G al) can be 
selectively induced in the presence of galactose (5). Since 
Gal4p activity is attenuated in glucose, this approach 
requires growing cells on alternative carbon sources like 
raffinose or glycerol. 

Improvements to expression systems using otherwise 
inert inducers such as the tetracycline analog doxycycline 
or the mammalian hormone (3-estradiol have reduced the 
perturbations of the physiological cell state due to intro- 
duction of the inducer (6-8). Although these previous 
systems have provided useful tools, demonstration of an 
expression system that satisfies the criteria of being (i) 
fast-acting, (ii) tightly regulated, (iii) nearly gratuitous 
(i.e. relatively few off-target effects) and (iv) graded over 
a range of inducer concentrations was only recently pub- 
lished for yeast (7). 

Here, we provide a full characterization of a 
next-generation (3-estradiol inducible expression system 
for yeast that displays true single-gene precision today as 
well as the potential for multiplex regulation in the future. 
This system utilizes a constitutively expressed artificial 
transcription factor (ATF) from the ACT1 promoter. 
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The ATF consists of a DNA-binding domain (DBD), the 
human estrogen receptor (ER) and the VP 16 activation 
domain (Figure 1). In the absence of inducer, the 
ER interacts with the Hsp90 chaperone complex, seques- 
tering the ATF to the cytoplasm (9). Introduction of 
(3-estradiol displaces Hsp90, revealing a nuclear localiza- 
tion signal, and the ATF translocates to the nucleus 
(Figure 1). The ATF as described provides a strong 
transcriptional activator that is dependent on the 
presence of [3-estradiol. By using synthetic DBDs that 
bind only a cognate DNA sequence in place of the 
Gal4p DBD, residual off-target effects have been com- 
pletely eliminated. 

Previously, the DBD from the yeast transcriptional 
activator Gal4p was used in the chimeric activator 
Gal4dbd.ER.VP16 (GEV) (7,10). Yet, as a DBD from a 
yeast TF, and a factor with only 6 bp of specificity, this 
DBD has many potential off-target sites in the yeast 
genome. DBDs of bacterial TFs (e.g. LacI and tetR) 
have been repurposed for use in the development of 
gene expression systems that are orthogonal to the 
native eukaryotic regulatory machinery. However, as dis- 
cussed elsewhere (11), bacterial DBDs have numerous 
limitations (oligomerization issues, cooperative binding, 
etc.) that restrict their use for engineering novel TFs for 
use in eukaryotes. 



We constructed a set of ATFs that utilize the modular 
and designable Cys 2 His 2 zinc-finger DBD. A single 
zinc-finger domain contains ~30 amino acids and will 
offer three to four bases of specificity, while multiple 
domains can be assembled as zinc-finger arrays where 
each additional finger adds three bases of specificity [re- 
viewed in (12)]. This domain has previously shown to be 
engineer-able to take on novel specificity and function in a 
somewhat modular fashion while high-quality zinc-finger 
arrays have been selected by multiple methods (13-15). 
Through decades of research some basic rules of specificity 
have been gleaned and specificity can sometimes be 
designed (16,17). ATFs have previously been constructed 
by fusing a designed Cys 2 His 2 zinc-finger DBD to a 
hormone-responsive domain and transcriptional activa- 
tion domain and were functional in mammalian cells 
(18,19). A repressive ATF utilizing an engineered 
Cys 2 His 2 zinc-finger DBD has been reported to achieve 
single-target precision (also in mammalian cells) (20). 

In this study, we replaced the Gal4dbd of GEV with 
either the three-fingered DBD of the mouse TF Zif268 
or a rationally designed four-fingered zinc-finger array 
(Z 4 ). We refer to these constructs as Z 3 EV and Z 4 EV, re- 
spectively. The design of the Z 4 array leaned heavily on 
previously defined design principles and targets that have 
been shown to be most effective (i.e. GNN fingers). The Z 4 




Figure 1. Schematic of hormone-based gene expression system. ATFs contain a DNA-binding zinc-finger array, the ligand binding domain of the human 
estrogen receptor and the VP16 activation domain. In the presence of P-estradiol (1), ATFs dissociate from Hsp90 (2), translocate to the nucleus (3) and 
activate transcription of a gene of interest (GOI) (4). Once produced (5) the gene products can be detected using a variety of methods. 
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array was designed to target not only a sequence that is 
not found in the yeast genome but also a sequence that 
accommodated a likely to be successful assembly of 
GNN-binding zinc fingers. 

We demonstrate that the ATFs Z 3 EV and Z 4 EV are fast- 
acting and achieve a graded output response over a range of 
inducer concentrations, as did the GEV system. Using gene 
expression microarrays, we find that while GEV activation 
results in the induction or repression of several hundred 
genes within 30min, these new ATFs have no significant 
off-target expression response. As a result, these ATFs 
eliminate the growth defect incurred by strong induction 
of GEV. Finally, we demonstrate the power of these new 
expression systems by inducing GCN4 and following the 
genome-wide expression responses dynamically. 



MATERIALS AND METHODS 

Media and growth conditions 

For batch culture experiments, cells were grown in either 
synthetic complete medium lacking uracil or YPD (1% 
yeast extract, 2% bacto-peptone and 2% dextrose), 
unless noted otherwise. Induction of GEV, Z 3 EV or 
Z 4 EV by P-estradiol (Tocris Biosciences, Ellisville, MO, 
USA) was performed in cells during log-phase growth 
(culture absorbance = 50-100 Klett units). Chemostat 
cultures were maintained in phosphate-limited growth 
medium (20mg/l potassium phosphate). When needed to 
complement an auxotrophy, phosphate-limited media was 
supplemented with excess L-Leucine (200mg/l). The com- 
ponents of chemostat medium are carefully prepared to 
mimic the composition of standard yeast nitrogen base 
medium and have been detailed previously (21). 

Strain and plasmid construction 

Transformations were performed with a standard lithium 
acetate method. To construct Z 3 EV and Z 4 EV strains, the 
Gal4dbd of GEV was deleted by PCR-mediated disruption 
with URA3. Zif268 and Z 4 DNA-binding domains with 
homology to the ER and ACT1 promoter were then trans- 
formed into cells and selected via 5-Fluoroorotic Acid (5- 
FOA) counter selection of URA3. Single colonies were 
isolated and sequenced to verify the presence of the proper 
DNA-binding domain (Supplementary Figure SI A). 

The reporter plasmid was created from the base plasmid 
pRS416 (gift from Megan McClean), a CEN plasmid con- 
taining the URA3 selectable marker. The GAL1 promoter 
region was amplified from genomic DNA. Overlap- 
extension PCR was used to add the restriction enzyme 
sites for Xbal and NotI, respectively, on either side of 
the region of the three continuous Gal4p-binding sites 
5 / -CGG-Nn-CCG-3 / (22). This promoter fragment was 
then cloned into pRS416 in front of green fluorescent 
protein (GFP) using the restriction enzymes Nhel and 
Xmal. Before zinc-finger-binding sites were added the 
Xbal site in GFP was re-coded using a silent mutation 
to remove this restriction site. Finally, the three canonical 
Gal4p-binding sites were removed via digestion with Xbal 
and NotI and triplets of dimeric Z 3 EV and Z 4 EV-binding 



sites were cloned into respective plasmids (Supplementary 
Data and Supplementary Figure SIB). 

To construct the inducible GCN4 allele, KanMX- 
Z 4 EVpr was amplified from pMNIO with the primers 
S'-caatttgtctgctcaagaaaataaattaaatacaaataaaCGCACTTA 
ACTTCGCATCTG-3' and 5'-tggatttaaagcaaataaacttggc 
tgatattcggacatTATAGTTTTTTCTCCTTGACG-3 / and 
transformed into Z 4 EV-containing parent strain. The 
uppercase portions of the sequences share homology 
with the KanMX-Z 4 EVpr cassette on the plasmid 
pMNIO. 

RNA extraction, labeling and hybridization 

RNA extraction, labeling and hybridization were per- 
formed as described previously (7). Five milliliters of 
cells were harvested from chemostat cultures by vacuum 
filtration onto 0.45-(im nylon membranes (Millipore, 
HNWP02500) and flash frozen in liquid nitrogen. Crude 
RNA was extracted with a standard acid-phenol method 
and subsequently cleaned with RNeasy (QIAGEN, 
Valencia, CA, USA). Cleaned RNA was labeled using 
the Agilent Quick- Amp Labeling Kit (#5190-0447). 
Reference RNA was extracted from a laboratory 
wild-type strain (DBY 12001) grown to steady state in 
phosphate-limited growth medium with a doubling time 
of 3.9 h. Microarrays were hybridized for 17 h at 65°C on 
a rotisserie at 20rpm. Hybridized microarrays were 
washed, scanned and raw data were extracted with 
Agilent Feature Extraction Software version 9.5. 

Microarray analysis 

Sample and reference channel intensities were first floored 
to a value of 350 (23,24). Once log 2 ratios were computed 
between samples and reference, the data were time-zero 
transformed. Genes that were flagged by Agilent Feature 
Extraction Software or equal to 0 at every time point were 
removed in R (25). Data were hierarchically clustered in 
the Cluster 3.0 software package (26) with average linkage 
using the Pearson correlation distance as the metric of 
similarity between genes (27). 7^-means clustering and 
figures of merit analysis were carried out with the MeV 
software suite (28,29). Enrichment of TF binding to pro- 
moters in each cluster was determined with Fisher's exact 
test. P-values were Bonferroni-corrected to account for 
multiple comparisons. 

Quantitative real-time PCR 

RNA was converted to cDNA with MultiScribe™ 
Reverse Transciptase using random hexamer priming 
(Applied Biosy stems). Quantitative real-time PCR 
(qRT-PCR) of lOOng cDNA was performed in triplicate 
with SYBR® Green PCR Master Mix on an ABI 7900HT 
series PCR machine. PCR conditions were lOmin at 95°C 
(one cycle) followed by 15 s at 95°C and 1 min at 60°C (40 
cycles). Analysis was performed using SDS2.3 software. 
C T values were computed using the standard curve 
method. GFP expression levels were normalized to 
TDH2 expression. Primers for amplification from 
GFP were 5 / -TTTCTGTCTCCGGTGAAGGT-3 / and 
5 / -GACTAAGGTTGGCCATGGAA-3 / . Primers for 
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amplification from the housekeeping gene TDH2 were 
5 / -TGACTCCACTGGTGTTTTCAAG-3 / and 5'-ACCT 
TCTTCAATACCGAAAGCA-3 / . 

Flow cytometry 

Approximately 10 7 cells were harvested by centrifugation 
and subsequently washed and resuspended in 
DPBS + 0.1% Tween-20. Measurements of GFP fluores- 
cence were performed with a BD LSRII Multi-Laser 
Analyzer with HTS (BD Biosciences, Sparks, MD, 
USA). Mean fluorescence values were determined from 
at least 50 000 cells. 

In Figure 3, dose response curves were fit to a Hill 
function of the form G(D) = G min +(G max - G min ) ^^r, 
where G is the level of GFP, D is the amount of 
(3-estradiol, n is the Hill coefficient and K is the dose 
yielding half of (G max + G mm ). G min and G max are con- 
stants that are the minimum and maximum mean GFP 
levels from the data. Data were fit using least-squares re- 
gression with the cftool function in MATLAB 
(MathWorks, Natick, MA, USA). 

Zinc-finger array design 

The Z 4 array was designed to bind a sequence that does 
not exist in the yeast genome while keeping design prin- 
ciples biased toward GNN sequences that have been 
shown to be more functional and modular (30,31). The 
zinc fingers were designed with four primary contacts 
that have been described: Arg's at position —1 and 6 of 
each recognition helix to offer G specificity, Asp at 
position 3 of the helix to offer C specificity and Asn at 
position 3 to offer A specificity. To avoid high affinity that 
may be offered by cross-strand contacts, and therefore 
bias binding by a subset of zinc fingers as opposed to 
the whole four-fingered array, small amino acids such as 
Ala and Gly were employed at position 2 of each helix. At 
the more flexible positions 1 and 5, residues were either 
chosen because of size or charge. Furthermore, as a four- 
fingered protein, the backbone of one finger of the three- 
fingered Zif268 was duplicated and recoded. We chose to 
repeat finger 2 and extend the linker between the second 
and third fingers (TGSQKP) as has been described (32). 

The final assembly used the backbones of Zif268 fingers 
1-2-2-3. As a result, the modest design principles noted 
above were used to design the recognition helices on each 



of the four fingers, while the backbone residues were 
copied from the Zif268 fingers in a finger 1-2-2-3 
pattern (the exact coding sequence can be found in the 
Supplementary Data). Similar finger helices to those 
designed here have been shown to be functional on these 
targets when engineered with the OPEN system, giving us 
confidence that this would be a functional finger array 
with the desired specificity (31). 

Bacterial one-hybrid assay 

Characterization of the DNA-binding specificity of the Z 4 
protein was performed by bacterial one-hybrid assay as 
previously described (33,34). The protein was assembled 
by overlapping PCR using the Zif268 scaffold as a 
template and cloned into the pBlH2w2 plasmid (34). 
Using this system, functional binding sites are found by 
selecting for sequences that are able to activate the HIS3 
reporter gene in bacteria from a 28-bp library of 
randomized DNA sequences upstream of the promoter 
(Supplementary Figure S2). Functional binding sites 
were recovered by sequencing this promoter region from 
surviving clones (see Supplementary Data for sequences) 
and analyzed by MEME to discover an over-represented 
motif that represents the DNA-binding specificity of the 
protein (Supplementary Figure S2) (33). 



RESULTS 

Identifying non-yeast DNA-binding domains with few 
potential genomic-binding sites 

In order to produce a synthetic system that has no 
off-target induction, we sought synthetic target sequences 
extremely rare or absent from the yeast genome. The 
Gal4p DNA-binding domain of GEV dimerizes and 
binds to the sequence 5 / -CGG-Nn-GCC-3 / (Figure 2). 
Scanning the Saccharomyces cere vis iae genome for in- 
stances of this motif, we identified more than 500 potential 
binding sites. The canonical motif recognized by Zif268, 
the zinc-finger domain used in Z 3 EV, is GCGTGGGCG 
(Figure 2), for which we only find 11 predicted binding 
sites. The motif recognized by the rationally designed 
zinc-finger array used in Z 4 EV is GCGGCGGAGGAG 
(Figure 2). The Z 4 zinc-finger array was specifically 
designed to target this sequence because it is not present 
in the yeast genome. While the motifs for Gal4p and 
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Figure 2. (A) The structure of the ATFs Z 4 EV, GEV and Z 3 EV. (B) The DNA-binding motifs of Gal4p, Zif268 and the Z 4 array. 
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450 p 




([p-Estradiol]) nM 



Figure 3. Dose response curves. Strains yMN5 (GEV), yMN7 (Z 3 EV) 
and yMN14 (Z 4 EV) were grown to early log-phase and then different 
amounts of P-estradiol were added to the growth medium. GFP was 
quantified following 12 h of induction by flow cytometry. Error bars 
represent ± 1 SD from three independent cultures. 



Zif268 have been published previously, we confirmed the 
DNA specificity of the Z 4 array with a bacterial one- 
hybrid system (33,34). 

Measuring transcriptional output 

To characterize the responsiveness of Z 3 EV and Z 4 EV, we 
constructed synthetic promoters based on the zinc-finger 
DNA-binding specificity (see Supplementary Data for 
sequences). Each promoter mimics the GEV promoter, 
containing six monomeric binding sites, which allows for 
three dimerizations of the ER as one would see in the 
GEV system. These promoters are referred to as Z 3 EVpr 
and Z 4 EVpr, respectively. Gal4p recognition sequences 
used in the previous GEV system were replaced with 
either the Z 3 EV or Z 4 EV targeting sequences within the 
GAL1 promoter (Supplementary Data and Supplementary 
Figure SIB). Control experiments confirmed that the 
correct combination of DBD and promoter, as well as 
introduction of P-estradiol, were required for activation 
of a target gene (Supplementary Figure S3). When the 
UASgal sequences are removed, Gal4p can no longer 
induce transcription (Supplementary Figure S4). 
Furthermore, with the appropriate DBD-promoter com- 
binations, the dose response of each construct was 
characterized (Figure 3). No induction of GFP was 
observed for any ATF at concentrations of P-estradiol 
<lnM. Interestingly, a similar amount of GFP intensity 
is achieved with ~ 10-fold less induction of Z 3 EV or Z 4 EV 
in comparison to GEV (Figure 3). We determined that 
the Hill coefficient for each ATF-promoter pair is M, 
consistent with non-cooperative activation of 
transcription. 

The effect of ATF induction on gene expression 

To assess off-target expression directly, we measured 
genome-wide gene expression of cells grown to steady 
state in chemostats containing GEV, Z 3 EV or Z 4 EV and 
cognate reporter constructs (Figure 4A), and assessed 
global gene expression 0, 0.5 and 3h after addition of 



1 uM P-estradiol to the steady-state cultures. In response 
to GEV induction, 155 genes are induced >2-fold and 263 
genes are repressed >2-fold 3h following P-estradiol 
addition. In contrast only one gene is induced >2-fold 
in response to Z 3 EV activation at 3h. This gene is 
YOR343C, which is a dubious open reading frame 
(ORF) unlikely to encode a protein according to the 
Sac char omyces Genome Database (35). Similarly, zero 
genes are induced > 2-fold in response to Z 4 EV activation 
3h following P-estradiol addition; however, two genes, 
RPS8A (a ribosomal protein) and YDR133C (a dubious 
ORF), were induced 2-fold in the Z 4 EV time course at 
30min, but returned to 0-fold at 3h. 

Each of the chimeric activators induced a GFP reporter 
over the course of the experiment. Cells were visualized by 
microscopy to confirm GFP expression (data not shown), 
and the level of GFP transcript was measured by qRT- 
PCR, revealing that each of the chimeric activators 
induced GFP mRNA > 50-fold by 15min following 
addition of P-estradiol (Figure 4B). The maximum level 
of induction for the chimeric activators ranges from 
-100- to 300-fold. 

The effect of ATF induction on cell growth 

We next examined whether these differences in gene 
expression might be related to the growth defect we pre- 
viously reported to be conferred by the GEV system. We 
compared growth between Z 4 EV-, GEV- and Z 3 EV- 
containing strains in the presence of different amounts 
of P-estradiol (Figure 5A). Between 10 nM and 10 uM 
P-estradiol, the growth rate of GEV-containing cells 
decreased —70% in rich medium (Figure 5B). In 
contrast, induction of either Z 3 EV or Z 4 EV with 
10nM-10 uM P-estradiol has no effect on growth 
(Figure 5B). 

Tuning the transcriptional output of Z 3 EV 

The above results demonstrate the precision with which 
the Z 3 EV or Z 4 EV systems are able to activate a single 
target. However, they do little to indicate how sensitive the 
system is to the strength of the protein-DNA interaction. 
To this end, we tested the Z 3 EV system against four 
targets with known differences in binding affinity in com- 
parison to the consensus target of Zif268 (36). Within a 
9-fold range of dissociation constants, relative to the con- 
sensus sequence, activation of GFP is correlated to the 
strength of the protein-DNA interaction (Figure 6). 
Furthermore, a binding-site that offers a 20-fold reduction 
in affinity is not above background after 6 h of induction. 
These results illustrate that our system is tunable and is 
unlikely to function on binding sites that offer > 20-fold 
reduction of binding affinity. Future research might inves- 
tigate the perturbation of networks with finely tuned levels 
of transcription based on the use of one of these or other 
characterized binding sites. 

Using Z 4 EV to probe the Gcn4p transcriptional 
regulatory network 

To test whether this system is able to provide single-gene 
precision on a genomic target, we decided to target GCN4. 
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Figure 4. (A) Hierarchically clustered gene expression responses of continuous cultures containing GEV, Z 3 EV or Z 4 EV upon addition of 1 fiM 
[3-estradiol (data are time-zero normalized in each experiment). Time points in each experiment are 0, 0.5 and 3h. Each culture was maintained at a 
doubling time of 4.07 h. (B) Induction of GFP reporters in cultures from (A) measured by qRT-PCR. Error bars represent ± 1 SEM of three 
technical replicates. Strains are yMN5 (GEV), yMN7 (Z 3 EV) and yMN14 (Z 4 EV). 




To simplify constructing inducible genomic alleles, we 
created plasmids (pMN9 and pMNIO) with the KanMX 
cassette, which confers resistance to G418, fused to either 
Z 3 EVpr or Z 4 EVpr (template sequences can be found in 
the Supplementary Data). These synthetic promoters can 
be inserted into the genome by homologous recombin- 
ation to make an allele that is conditionally expressed in 
the presence of (3-estradiol (Figure 7A). Gcn4p is a strong 



transcriptional activator of a wide range of enzymes 
required for the production of amino acids 
(Supplementary Figures S5-S14). Cells lacking GCN4 
grow slowly in minimal medium (37), a phenotype we 
were able to recapitulate in a strain in which GCN4 is 
under the control of KanMX-Z 4 EVpr (Figure 7B). This 
growth defect is fully alleviated by adding 5nM 
(3-estradiol (Figure 7B). 
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Figure 6. Induction of GFP reporters driven by promoters of different 
affinity by Z 3 EV following addition of 1 uM |3-estradiol. Error bars 
represent ± 1 SD of three independent cultures. 



GCN4 is rapidly induced by Z 4 EV in response to 1 jiM 
(3-estradiol (Figure 7C) and in continuous cultures we were 
able to quantify the dynamic transcriptional impact of 
GCN4 expression on cells. By 2h, induction of GCN4 
resulted in >2-fold induction of 327 genes (~5% of the 
genome; Figure 7D and Supplementary Dataset SI), 
which are strongly enriched for cellular amino acid bio- 
synthesis processes (corrected P-value = 1.81 x 10 -50 ). 
There are 255 genes repressed > 2-fold. These genes are 
enriched for processes involved with cytoplasmic transla- 
tion (corrected P-value = 1.41 x 10 -11 ), ATP synthesis 
coupled proton transport (corrected P-value = 6.1 x 10 -4 ) 
and nucleotide biosynthesis (corrected P-value 
= 3 x 10 -3 ) (Figure 7D and Supplementary Dataset SI). 
The presence of Gcn4p binding based on previously 
published ChlP-Chip data (38) is noted next to the 
hierarchically-clustered gene expression data 
(Figure 7D), and clearly correlates with induced expres- 
sion. Based on ChlP-Chip, of the induced genes, 116 have 
previously been shown to be direct Gcn4p targets. Only 
five of the repressed genes (DLD3, ICY1, MET 3, RFA3 
and TIM 10) (Supplementary Figure SI 5) have been 
shown to be direct targets. Repression is typically 
mediated at late time points, consistent with an indirect 
response to Gcn4p production. 

To perform an unbiased search for enriched regulatory 
motifs corresponding to particular expression responses 
(determined via iC-means clustering) we used the 
FIRE algorithm (39). The number of iC-means clusters 
was determined using figures of merit analysis 
(Supplementary Figure SI 6). Two clusters were identified 
as having been strongly induced by Gcn4p (clusters 1 and 
2; Figure 7E). Based on FIRE, promoters of genes in these 
clusters are enriched for the presence of Gcn4p-binding 
sequences (Figure 7F; z-scores = 123.7 and 27.1, respect- 
ively). Furthermore, cluster 1 genes are enriched for 



Gcn4p, Rtg3p, Yap7p, Gln3p and Leu3p binding based 
on ChlP-Chip (corrected P- values = 1.47 x 10~ 55 , 
4.77x 10" 5 , 0.006, 0.031 and 0.06, respectively; Fisher's 
Exact Test). Cluster 2 genes are enriched only for Gcn4p 
binding (corrected P-value = 1.50 x 10 -30 ; Fisher's Exact 
Test). The remaining clusters show no enrichment for 
Gcn4p binding. 

These results, in combination with those detailed above, 
demonstrate the utility of these expression systems for dis- 
secting complex regulatory networks in vivo. They offer 
the ability to maintain precise levels of an individual 
gene's expression. We plan in the future to adapt other 
DBDs, target sequences or hormone receptors to make it 
possible to control several inputs simultaneously or in 
sequence with the same level of precision. 



DISCUSSION 

The synthetic biology and broader experimental 
communities have been hampered by a lack of quantita- 
tive expression systems for both engineering novel and 
probing native regulatory circuitry in vivo. In this article, 
we developed quantitative expression systems for yeast 
that address this problem. These expression systems 
achieve single-gene specificity: in an appropriately 
modified strain a single gene of interest can be selectively 
activated in an inducer-dependent fashion. Output expres- 
sion can be tuned to different levels by either changing the 
amount of inducer or the identity of the binding sites 
upstream of a target gene. We have taken advantage of 
the functionality of hormone receptors as effective 
post-translational switches in S. cerevisiae. With this 
methodology, we anticipate that by utilizing multiple 
hormone receptors with orthogonal-binding domains a 
combinatorial library of tightly-regulated, inducible 
expression systems can be used to design and program 
more sophisticated regulatory functions. These may also 
be combined with IPTG- and doxycycline-based expres- 
sion systems, the most recent of which have been 
characterized in (11). 

Here, we have used the Cys 2 His 2 zinc-finger DBD to 
provide the necessary affinity and specificity to 
site-specifically activate the target of interest. However, 
transcription activation-like (TAL) effector domains 
have emerged as promising tools for engineering 
DBDs in eukaryotes. TAL effectors are virulence factors 
utilized by pathogenic bacteria in the genus Xanthomonas 
(40). Specificity is determined by repeat variable 
di-residues (RVDs) at positions 12 and 13 while RVD 
regions that recognize each of the four nucleotides have 
been identified (41,42). Their modularity and relative sim- 
plicity suggests that TAL domains can be stitched together 
to form novel DNA-binding domains that recognize any 
DNA sequence of interest. Synthetic TAL effectors have 
been constructed that function as activators or repressors 
of eukaryotic transcription (43). Therefore, if applications 
would benefit from the use of multiple inducers used sim- 
ultaneously, a conversion to TAL-hormone receptor 
system might be beneficial. Still, limitations in the TAL 
assembly may result in the continued use of zinc fingers 
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Figure 7. (A) Schematic of strains used for inducing genomic GCN4 allele. Z 4 EV is driven by the ACT1 promoter. In the presence of |3-estradiol, 
Z 4 EV activates transcription of the native GCN4 allele driven by the synthetic promoter Z 4 EVpr. KanMX is expressed in the reverse orientation. 
(B) Measuring growth Z 4 EV-containing haploids containing GCN4 or Z 4 EVpr-GC7V¥ in the presence or absence of 5nM P-estradiol. Cells were 
grown in low phosphate chemostat medium. (C) The transcriptional response of Z 4 EVpr-GC7W (strain = DBY12423) in response to 1 jiM P-estradiol 
in a continuous controlled maintained under phosphate limitation. The culture was maintained at a doubling time of 6.3 h. (D) The global tran- 
scriptional response of cells in (C) measured out to 2h. We denote the presence of Gcn4p binding to a promoter by a blue line [(38) P-value <0.001]. 
(E) Expression data from (D) were divided into five expression clusters using A^-means clustering with the Euclidean distance similarity metric. The 
optimal number of clusters over-represented and under-represented motifs within these clusters was determined using the FIRE algorithm (F). 



(size, repetitive coding sequence, etc.). Moreover, as we 
have demonstrated in this manuscript, extensive biochem- 
ical characterization such as knowledge of Cys 2 His 2 
DNA-binding affinity can be used to predictably tune 
output expression of distinct targeting sequences. The 
wealth of literature on the zinc-finger protein-DNA inter- 
action may prove advantageous toward this end. 

Different inputs can be used to stimulate a network 
and reveal its relevant components and architecture. 



A 'transfer function' relates a prescribed input to a 
measured output. In the described case, we provided a 
step increase of Gcn4p to identify and quantify the re- 
sponsive outputs of Gcn4p activation. It is, however, 
often desirable to provide alternative stimuli. For 
example, to determine the bandwidth of a signaling 
pathway, the pathway input can be varied at different 
frequencies and the output can be measured using 
fluorescently tagged reporters. This approach has been 
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used previously to show that the Hoglp MAPK-signaling 
cascade acts as a low-pass filter in response to fluctuations 
in extracellular salt concentration and quantify its 
bandwidth (44). Future work may investigate how to 
directly provide cells with varying amounts of protein at 
desired frequencies. While varying extracellular concentra- 
tions of stimulus is relatively straightforward with 
microfluidic-based approaches, varying intracellular 
levels of protein over time presents a significant technical 
challenge. One solution may be the implementation of 
light-responsive expression systems, which can induce 
expression of a target gene in the presence of a particular 
frequency of light, and ablate expression in its 
absence (45). 

We previously demonstrated that by inducing the TEV 
protease with GEV, we could rapidly degrade an 
7V-degron-tagged protein with a half-life of <15min (7). 
By combining two orthogonal induction systems (one for 
inducing an 7V-degron-tagged allele and a second for 
inducing TEV) one could provide cells with different size 
and/or length protein pulses. In response to the first 
inducer the protein would be synthesized, and in 
response to the second inducer, the protein would be des- 
troyed by the protease. This approach could also utilize 
the TIR1 ubiquitin ligase from Arabidopsis thaliana, which 
is post-translationally induced by the auxin indole-3-acetic 
acid and is functional in yeast (46). This may be advanta- 
geous because TIR1 can be expressed constitutively and 
induced by auxin without having to be first transcribed 
and translated as in the case of TEV. 

In conclusion, synthetic approaches can aid in the 
understanding of native regulatory machinery and 
provide the toolkit to design and implement rationally 
designed genetic programs. We demonstrated the utility 
of a perturbative approach in quantifying the downstream 
effects of a major transcriptional regulator of amino acid 
biosynthetic genes. More broadly our work provides a 
new set of expression systems for a variety of experimental 
and potentially industrial applications. These systems 
serve as a flexible platform that can be used to tune, inte- 
grate and re-purpose eukaryote functions. 
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